Adaptive importance sampling for control and inference

نویسنده

  • Hilbert J. Kappen
چکیده

Path integral (PI) control problems are a restricted class of non-linear control problems that can be solved formally as a Feynman–Kac PI and can be estimated using Monte Carlo sampling. In this contribution we review PI control theory in the finite horizon case. We subsequently focus on the problem how to compute and represent control solutions. We review the most commonly used methods in robotics and control. Within the PI theory, the question of how to compute becomes the question of importance sampling. Efficient importance samplers are state feedback controllers and the use of these requires an efficient representation. Learning and representing effective state-feedback controllers for non-linear stochastic control problems is a very challenging, and largely unsolved, problem. We show how to learn and represent such controllers using ideas from the cross entropy method. We derive a gradient descent method that allows to learn feed-back controllers using an arbitrary parametrisation. We refer to this method as the path integral cross entropy method or PICE. We illustrate this method for some simple examples. The PI control methods can be used to estimate the posterior distribution in latent state models. In neuroscience these problems arise when estimating connectivity from neural recording data using EM. We demonstrate the PI control method as an accurate alternative to particle filtering.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Importance Sampling for Bayesian Inference in Gaussian Process Models

Gaussian process (GP) models are nowadays considered among the standard tools in modern control system engineering. They are routinely used for model-based control, timeseries prediction, modelling and estimation in engineering applications. While the underlying theory is completely in line with the principles of Bayesian inference, in practice this property is lost due to approximation steps i...

متن کامل

Prediction of Weld Strength in Resistance Spot Welded Samples by Adaptive Neuro-Fuzzy Inference System (ANFIS)

Resistance Spot Welding (RSW) is one of the effective manufacturing processes used widely for joining sheet metals. Prediction of weld strength of welded samples has great importance in manufacturing and different methods are used by researchers to find the fracture force. In this article, the Adaptive Neuro-Fuzzy Inference System (ANFIS) is utilized for prediction of joint strength in welded s...

متن کامل

Long-term Streamflow Forecasting by Adaptive Neuro-Fuzzy Inference System Using K-fold Cross-validation: (Case Study: Taleghan Basin, Iran)

Streamflow forecasting has an important role in water resource management (e.g. flood control, drought management, reservoir design, etc.). In this paper, the application of Adaptive Neuro Fuzzy Inference System (ANFIS) is used for long-term streamflow forecasting (monthly, seasonal) and moreover, cross-validation method (K-fold) is investigated to evaluate test-training data in the model.Then,...

متن کامل

Prediction of Weld Strength in Resistance Spot Welded Samples by Adaptive Neuro-Fuzzy Inference System (ANFIS)

Resistance Spot Welding (RSW) is one of the effective manufacturing processes used widely for joining sheet metals. Prediction of weld strength of welded samples has great importance in manufacturing and different methods are used by researchers to find the fracture force. In this article, the Adaptive Neuro-Fuzzy Inference System (ANFIS) is utilized for prediction of joint strength in welded s...

متن کامل

Optimal Control of Network Structure Growth

We formulate the problem of influencing the growth of a network as a stochastic optimal control problem in which a structural cost function penalizes undesired topologies. We approximate this control problem with a restricted class of control problems that can be solved using probabilistic inference methods. To deal with the increasing problem dimensionality, we introduce an adaptive importance...

متن کامل

Controlling structures by inverse adaptive neuro fuzzy inference system and MR dampers

To control structures against wind and earthquake excitations, Adaptive Neuro Fuzzy Inference Systems and Neural Networks are combined in this study. The control scheme consists of an ANFIS inverse model of the structure to assess the control force. Considering existing ANFIS controllers, which require a second controller to generate training data, the authors’ approach does not need anot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1505.01874  شماره 

صفحات  -

تاریخ انتشار 2015